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Abstract 

A theory which describes the share price evolution at financial markets as a continuous- 
time random walk Scalas (2000); Mainardi (2000); Sabatelli (2002); Raberto (2002) 
has been generalized in order to take into account the dependence of waiting times 
t on price returns x. A joint probability density function (pdf) 4>x,T(x,t) which 
uses the concept of a Levy stable distribution is worked out. The theory is fitted to 
high-frequency US $/ Japanese Yen exchange rate and low-frequency 19th century 
Irish stock data. The theory has been fitted both to price return and to waiting time 
data and the adherence to data, in terms of the x 2 test statistic, has been improved 
when compared to the old theory Sabatelli (2002). 
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1 Introduction 



The id ea to describe th e stock market share prices as a random walk dates 
back to Bacheliei (1900). He assumed that price returns x and waiting times t 



are independent, identically distributed (i.i.d.) random variables which in the 
long time limit, according to the Central Limit Theorem (CLT), conform to 
the normal distribution. 



The p roperties of random walks are well understood [see for example MontrolJ 



(1965)] and since low-frequency market data conforms fairly well to a normal 
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distribution the concept of a Continuous Time Random Walk (CTRW) has 
been incorp o rated into standard economical textbooks. This approach enabled 
Black et all (|l973h to formulate a theory of pricing financial derivatives for 



which they received the Nobel price in economics. 

However there are serious problems with CTRWs. The observed probability 
density function (pdf) of pri c e returns 4>(x) shows c onsider a ble d iscrepancies 
from a Gaussian |Luxi (1996); IGopikrishnanI (|l999yPlero"u1 (Il999h l In partic- 



ular it exhib its power law tails for large values of x. iMandelbrotl ((1963) and 
Fama ( 1970j ) managed to generalize Bachelier's approach by introducing Levy 



distributions L Mj( g(x) with the normal distribution being only a particular rep- 
resentative of these. 

Levy distributions, functions labeled by two real parameters a and /3, are 
limit distributions for s ums of independ ent random variables which emerge 
in the generalized CLT iBouchaudl (|l990h . Taking (3 = and < a < 2 en- 



sures that the functions are both positive, even (L^p^—x) = L^p(x)), behave 
like a steep Gaussian for small values of x and decay according to a Pareto's 
power law much slower than a Gaussian for large x's. Despite technical prob- 
lems, the second moment being infinite for a < 2, these functions have been 
succes sfully used to m odel relaxation processes (die-el ectrical, mec h anical and 
NMR) iMontrollI (|l984h and in the theory of probability [Levvl f|l937h ; iGnedenkol 



Recently, an investigation of about 40 million price quotes at the New York 
Stock Exchange (NYSE) IGopikrishnanI (|l998| ) showed that the cumulative 
probability density of price returns S(x) = P(X > x) behaves like S(x) ~ x~^ 
with fi = 3.1±0.003 which yields a value of \i beyond the valid range < /i < 2 
if the pdf were to be modeled by a Levy distribution. Levy distributions are 
well defined for \x > 2 but can take negative values in that regime hence are 
not good candidates for pdfs. 

Furthermore in most treatments so far it is assumed that price retu rns x are 
i ndepe ndent of waiting times t. This does not seem to be the case Rabertol 



(2002) and it also seems counterintuitive. One would expect larger price vari 



ations to correspond to larger waiting times since one would expect it to be 
more difficult, take more time, for a broker to find buyers and sellers that 
balance out supply and demand if the price variation is larger than if it was 
smaller. Such simplified reasoning implies that there should be a positive cor- 
relation between x's and fs which has not been taken into account in many 
works on this subject. 

In this paper we will generalize the theory to take account of these two diffi- 
culties. We will drop the assumption that x's and i's are independent random 
variables but still retain the property of independence of variables correspond- 
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ing to different times. By assuming that it is a certain function of x and t that 
conforms asymptotically to the Levy distribution rather than either price re- 
turns or waiting times themselves we will work out the joint pdf </>x,t(^,0- 
Theoretical results will be compared to some high-frequency market data. 

The paper is organized as follows. In section 2.1 we formulate our theory 
and work out analytical formulas for pdfs and cumulative pdfs under various 
assumptions about how price returns and waiting times are correlated. In sec- 
tion 2.2 we set up an equation for the probability p(x,t) of having price x 
at time t and show that price evolution does not satisfy the Markovian prop- 
erty. We co mpare our evolut ion equation to that worked out under different 
assumptions Sa batelli (|2002h . Finally in sections 3.1, 3.2 and 3.3 we fit the 



cumulative pdfs of high-frequency exchange rate and low-frequency 19th cen- 
tury stock data to our theory, discuss the goodness of fit and assess whether 
the correlations between x's and t's which follow from our theory conform to 
market data. 



2 A random walk with non-independent price returns and waiting 
times. 



2.1 Pdfs and cumulative pdfs. 



In the following we will adhere to the convention that random variables are 
denoted by capital letters and values of random variables by lower-case letters. 
The objective is to work out the joint pdf of price returns and waiting times 
(px,T{x,t) under the assumption that the random variables X and T are not 
independent. The first idea which comes into a mind of a physicist is to assume 
a weak dependence and to develop a perturbative approach. 

One could for example consider the correlation coefficient between X and T: 

= MS (!) 

/E[X 2 ] - E[X} 2 Je[T 2 } — E[T} 2 



with E[f] being the mean of / and minimize it as a functional of the joint pdf 
4>x,t{x, t) subject to an assumption that the pdf belongs to a certain functional 
space, ie is a linear combination of certain basis functions. However, since it 
is rather unclear how to choose these functions we will develop a different 
approach. 

We assume that it is not the price return x or the waiting time t that conforms 
to a Levy distribution but that a certain function f t = f(x(t),t) has that 
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property. This means that there exists a random variable Ft, which depends 
on the random price returns X and waiting times T, the change of which in 
the time period \T 1 ,T N ] takes, for large N, the following form: 



N 



where Levy(o;, (3) is a random variable conforming to a Levy distribution with 
parameters a and (5. Equation (2) is a statement of the Generalized CLT under 
the assumption that the increments Ft 1+1 — Ft { are i.i.d random variables. 

— * 

We also assume that instead of one stock there can be a set of stocks X = 
{X\, . . . ,X p } the prices of which are correlated with each other and so Ft 
depends on all stock price returns X, ie Ft = F(X,T), and Ft conforms to 
the Levy distribution (2). 

In order to work out the distribution (px,T(x,t) of the returns and waiting 
times we consider a transformation of variables in the p + 1-dimensional space 
with Cartesian coordinates x±, . . . ,x p ,t: 



(x 1 ,x 2 , ...,x p ,t) — ► (/, 0i, . . . , 9 P ) (3) 



We require the elementary probabilities P () to be conserved: 



1 f<F<f + df 



9 i <e i <9 i + dO t 



:= <j)(x,t)dxdt{A) 



where Qi(9) for i — 1, . . . ,p are certain functions of the variables 9{ which we 
set equal to one 0, = 1. 

Now the problem is to make a proper choice of the function / = f(x,t). We 
will assume that / is a hypotenuse in a right triangle with edges of lengths 



\x\ and t (see Fig.l). Therefore / = yjx\ + . . . + x 2 v + t 2 and the most natural 
transformation (3) that comes into mind here is a mapping into spherical polar 
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x, 



Fig. 1. Illustration of our assumption concerning the random walk. Here the changes 
in lengths of consecutive hypotenuses(marked with dashed lines) are i.i.d. random 
variables. Therefore, on the grounds of the generalized CLT, the length of the hy- 
potenuse conforms, in large time limit, to a Levy distribution (compare (2)). 



coordinates Arfken 



X\ = f sin 9 P sin p _i ... sin 2 sin 0\ 
xi — j sin Op sin Q p -\ . . . sin 2 cos 0\ 
x 3 = f sin P sin Q p -\ . . . cos O2 



(5) 



X I 



f sin P cos p -i 
f cos Op 



where < P < it and < Oj < 2ir for j 
transformation (5) reads: 



1. The Jacobian of the 



J 



(9(xi , . . . , Xp, t) 



d(f,0 



1: 



f p sin 2 sin 2 3 • • ■ sin^" 1 p = f ■ M (x) 



(6) 



where 



M(x) 



\ 



n(E 

g =2 s=l 



(7) 



and the joint pdf of price returns and waiting times reads: 
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<j> j Z T (x,t)=L li0 (f) ■ 



d(f,0 1 ,...,0 p ) 



d(xi, ...,x p ,t) 



/ M(x) 



x\ 2 + t 2 M{x) 



(8) 



Integrating over all x values with a fixed modulus x we obtain the pdf (f)x,r(x, t) 
which corresponds to the length of the vector of price returns x — \x\ and to 
the waiting time t : 



<Px,T(x,t) = J <f)^ T (x,t) 



d{x\, ...,x p 



d& . . . dC P -i = - 

7T 



2L ft0 (v^ + ?) 
Vx 2 + t 2 



(9) 



In equation (9) we applied a mapping into spherical polar coordinates in p- 
dimensional space (xi, . . . ,x p ) — > (x, £i, . . . , £p_i) the Jacobian of which, 
according to equation (6), reads: 



d{x\ , . . . , Xp) 





/p-1 <? \ 


= X • 

\ 


n(E^) 


\q=2 s=l J 



x ■ ^x 2 - x 2 ■ yx 2 -x 2 - x 2 p _ x ■ . . . ■ \jx\ + x\ = M{x) (10) 



Finally we work out the cumulative density of waiting times 
S T (t) = P{T > t) and that of price returns S x {x) = P(X > x): 



S T (t) = J Mm = lj j / , + + J ] dridi (11) 

Now, the double integral on the right-hand-side in (11) is computed by going 
to polar coordinates rj = z cos and £ = zsin0. The cumulative density Sx(t) 
reads: 



oo . . t/2 oo 

S T (t) = - f dzz^± j d<P=- I dzL^(z) • (tt/2 - O ) 

7T J Z J 71 J 

t (f> t 

2 °° t 

= — j arccos (^-^L^(z)dz (12) 
t 

where O — arcsin (t/z). Since the marginal pdfs 4>x{x) and take the 

same form, it is readily seen that the joint pdf in (9) does not change when 
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x and t are mutually exchanged, the cumulative density of returns Sx(x) is 
given by the same formula as (12) except that t is replaced by x. Let us also 
write down the large t expansion of the cumulative density. 

*(«> = ^ E \dTy^ + M sin(^/2) B (5, »—) - ( 13 ) 



where £>(n, m) = T(n)T(m)/T( y n+m) is the beta function. The expansion (13) 
evaluated by making use of an asymptotic expansion of the Levy function and 
may be useful for numerical calculations of Sr(t) for large values of t. 

It is interesting to compare these results with a different assumption for the 
function /. Assume that it is the sum of moduli of price returns and the 
waiting time that conforms to the Levy distribution / = \xi\ + . . . + \x p \ + 1. 
Looking at Fig.l one could say that it is the sum of neighboring horizontal and 
vertical sections \t n \ + \x n \ in one knee of the random staircase that conforms 
to the Levy distribution in large time limit. The transformation of variables 
(3) is now given by: 

xi = 6J 

X2 = 02/ 

; =; (14) 

Xp dpf 

t = f\\-e l -...-e p ) 



where < 9i < 1 and < 1- The Jacobian reads (— l) p f p and the joint 

pdf takes the form (f)g T (x,t) = L Mj0 (/)// p - The joint pdf (f>x,T(x,t) which 
depends on the modulus of the x vector in the L\ norm x = Yh=i x i reads: 



(px,T(x,t)= J 4>z tT {x, t)dx = L ^ ^ y - J dx = 

x % =x Xj=x 

_ x p ~ l L^ fl (x + t) 

~ (p - l)\(x + t)P [ ' 

The cumulative probability distribution of waiting times Srif) reads: 



(p-iy.(x + zy p\ t 
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Function 


Joint pat 




Kernels 


Asymptotic behavior 






of cumulative pdfs 


of cumulative pdfs 






in time 


in price returns 


in time 


in price returns 


f = f(x,t) 


4>X,T(x,t) 


K T (t,0 


K x (x,0 






Vx 2 + t 2 
x + t 


f 

P 


arccos | 


arccos | 


C/tP 
D/P 





Table 1 



Probability distribution functions in a CTRW with non-independent increments. 
The kernels Kx(x,£) and KT(t,£) determine the cumulative densities in the 
following way: Sw(w) = J°° Kw{w,$,)L^o(^)d$, where W G {X, T} and 
w G {x,t}. The constants C and K which define the asymptotic, large argu- 
ment behavior read C = l/(27r/i)r(l + n) ean((irn/2))B(l/2, {ji + l)/2), Z? = 
l/(7rp!)r(l + /x) sin((7r/x/2)) J B(p + 1, /i) and £ = Dp/ + p)B(p + 1, /*)) 

and that of price returns Sx(x) takes the form: 

- hh V-mtty = s/^w (' " ©") * (17) 

For both cases considered above the cumulative density of either waiting times 
of price returns takes the form: Sw(w) = i\V(tu, 0-k/*,o(0^£ where VF G 
{X, T} and u> G {x, £} and the kernels l\V(w, depend on the function f(x, t) 
and on the number of stocks p. The functional form of Sw(w) allows us to 
work out an asymptotic expansion of the cumulative density of waiting times 

Sr(t) = — E — T\ — B (P + 1, MO r(l + ilk) sin(7rM/2)— (18) 



and that of price returns: 

«*) = ^ g '"T' J + p) rd + **) sin(^/2)-L (19) 

The results are summarized in Table 1. 

Notice that it is not possible to derive a small w expansion of Sw(w) around 
w = even though a Taylor expansion of the Levy function L^o(x) around 
x = exists for certain values of the parameter \i. Indeed Sw(w) cannot be 
expanded around w = because all its derivatives at w = do not exist. 
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Firstly, for the case / = \J x 2 + t 2 we get from equation (9) 



dS T (t)/dt = -<p T (t) = f <f> x , T (x,t)dx = f- ^Xl^ Ux (20) 



and see that the integral on the right-hand side of (20) diverges for t — since 
it contains an non-integrable 1/x singularity at x — 0. The same happens for 
the case in the second row in Table 1. 



2.2 The evolution equation 



We now work out an evolution equation for p(x,t) = P(X t = x) ie the prob- 
ability that the stochastic process X t which describes t he time dependence o f 
price returns has value x at time t. We know |Montrolll (|l965h : lBalescul (|l997h ] 



that p(k,s), the Fourier transform of p(x, t) with respect to x and a Laplace 
transform with respect to time t, takes the form: 



p(k,s)= ^ . (21) 

W(M 



where 



(j)(k, s) = F x [L t [(/>x,t( x M 
}(s) = L t [Mt)] 



[22) 



Here the variables in parenthesis (x, k) and (t, s) are mutually conjugate and 
F x , L t are Fourier-Laplace transforms (FLTs) with respect to x and t respec- 
tively. 

The evolution equati on is worked out by taking inverse FLTs of (21) (compare 
with equation (1) in Sabatelli ( 2002j ) and takes the form : 



J $(t - t')-^P( x , t')dt' = -p(x, t)+ J J X(x - x', t - t')p(x', t')dx'dt' '(23) 



t oo 



-oo 



where the kernel $(t) is related in Laplace space to the cumulative probability 
density S(s) = L t or to the probability density <fi(s) by: 

Hs) = * {S ) = 1 (24) 
1 — sS(s) scj)(s) 
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and the kernel \(x, t) is related in Laplace- Fourier space to both the joint pdf 
and to the pdf of waiting times by: 

A(M) = ^ (25) 
0(s) 



In general the process is non-Markovian, ie the present value of p(x, t) depends 
on the whole history of both p and its time derivative ^jp(x, t'), unless the time 
dependence of the kernels reduces to a Dirac delta function. To demonstrate 
the non-Markovian behavior for our theory all we need to do is show that 
$(s) 7^ 1 in (24) or equivalently that 0(s) ^ 1/(1 + s). Then: 



fa) = ±J eMst} j -j0jzd£dt =±J L M! o(£K / dt e -^^(2Q) 

t 

oo tt/2 

= -J LpfliOdt J exp{-(sOsm6}d9 (27) 
o o 

t/2 oo 

= - j J L^ (0exp{-(s0sm9}d^de (28) 


tt/2 

= - / exp{-(s sin ey}dO (29) 

1 - (-ir r((/m + l)/2) 1 
" V^i n! r(l + F /2) ^ 1 + s ( j 

The transition from (26) to (27) follows from a substitution t — £ sin and 
equality of terms (28) and (29) follows from the integral representation of the 
Levy function and from applying the Cauchy integral theorem. 

Note that the representation of the joint pdf in Laplace-Fourier space <fi(k, s) 
can also be calculated explicitly: 



4>{k ) s)= j j — exp{ikx — st} — — — dtdx (31) 

J J TV J x 2 + t 2 

-oo 

= —J J L^ (r) exp [— {s cos — i sin <p}r] drd<p (32) 

^ 

tt/2 

2 f 

— — exp{ — (scos0 — ik sin (p)^}d(f) (33) 

TV J 
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tt/2 

2 



7T 



y exp{ — (ssin0 — ik cos (f)Y}d(j) (34) 
o 



Equality (31) = (32) follows from going to polar coordinates (x, t) — > (r sin (ft, r cos i 
and equality (32) = (33) follows from the same transformation as that leading 
from (28) to (29). Finally the last equality (33) = (34) follows from substi- 
tution of variables <fi < — 7r/2 — <fi. Equations (25), (29) and (34) allow us 
to draw the conclusion that the kernel X(k, s) in Laplace-Fourier space does 
not factorize and therefore the evolution equation (23) includes a term on the 
right-hand-side (the double integral over x' and t') that is non-local both in 
ti me and in space . This turns out to be different from the evolution equation 



in 



Sabatelli (2002) where the respective term was non-local in space only. 



Having obtained results (29) and (34) we can also look at the form of the 
evolution equation in the limit of large price returns x and large waiting times 
t. It only amounts to finding the form of the joint pdf 4>(k, s) and the waiting 
time pdf <p(s) for small values of k and s and inserting them into the evolution 
equation (21) in Laplace-Fourier space. Expanding the exponential in (29) in 
a power series and integrating term by term we get: 

4>(s) = 1 - + 0{s 2 ^) where a x = B f|, -) - (35) 



7T 



and from (34) we obtain: 



2 f 8 s 2 \ 

l--y (s sin (f>-ik cos 1- fcM ft + J (36) 



where 



Pi = B 



2 1 2) 7i 



71 

li- 1 3\ -e~" m l 2 ^ 



ft = /*0* - l )/ 2B ' n ( 37 ) 



7T 



Inserting (35) and (36) into the evolution equation (21) yields an equation 

^ [ft + I3 2 S - + ft^ j p(fc, S ) = Qt!^" 1 (3J 
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that can be reformulated in terms of fractional derivatives Samkol ( 1993f ) in the 
x, t space, ie after performing an inverse Laplace-Fourier trans form. Here we 
merely note that the equation is different than that obtained in Scalasl (2000) 
which is not a surprise since we dropped the assumption of independence of 
x's and £'s. 



3 Empirical analysis 



We now analyze two sets of stock market data, fit the theory developed in 
section 2.1 to the data and finally discuss the goodness of the fit. 

The first data set comprises prices for 10 stocks(5 railways and 5 banks) from 
the Irish stock exchange during the period 1850-54. Transactions during this 
period were carried on a daily basis so all waiting times are a day or longer. 
Over the 5 year period we have 1751 data points. 

The second set contains 2220695 exchange rate quotes of US$ to the Japanese 
Yen taken between 1989 to 1998 with waiting times ranging from a minute to 
some hours. The stock prices fluctuate with a much higher frequency than in 
the former case and the statistical error is much smaller due to a much larger 
size of the data sample. This allows a more reliable fit. 



3.1 Cumulative distributions of price returns and waiting times 



The objective is to fit theoretical cumulative density functions Sx{t) (12,16) 
and Sx{x) (17) to market data 



Nx(t) := number of data points with waiting times not smaller than t 
Nx(x) := number of data points with price returns not smaller than x 

We consider both assumptions from Table 1 regarding the way returns and 
waiting times are correlated, assumptions which result in two different formu- 
las for Sxit). However it turns out that there is no great difference between 
them. Our task consists in choosing three parameters v, w and /i in such a 
way that the chi-square test statistics Xr an d Xx are as small as possible. 



Xx ■= Yl 



(Nj<(ti) / Nt{0) — Sxivti)) 2 
ti S T (vti) 
(iV x (x 4 )/iV x (0)-5 T (^)) 2 



S T (wXi) 



(39) 
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where the sums run over the measured waiting times ti and price returns 
Xi for % = 1, . . . ,n. For our data sets we chose n = 200. Parameter values 
can be estimated by fitting a power law S T (t) = (cut) 61 for t G (t u t 2 ) and 
S x (x) = (a 2 x) b2 for x G (xi,x 2 ) to the asymptotic behavior of distribution 
functions. The ranges ti < t < t 2 and xi < x < x 2 have to be chosen with 
caution. The variables t and x have to be large enough for the power law to be 
valid but cannot be too large because the finite size of the data file introduces 
a significant error to N T (t) and N x (x). 

For the high frequency US$/Yen data we have fitted following power laws to 
the distribution functions: 



S T (t) ~ ( ai t) bl = (0.813t)~ L8662 for 50 < t[min] < 300 (40) 

S x (x) ~ (a 2 xf 2 = (0.451a;) ~ L8662 for 50 < 50000a; < 300 (41) 

Similarly the asymptotic fits for the low frequency old Irish data read: 

S T (t) ~ (ait) 61 = (1.099t)~ a42 for 30 < t[days] < 160 (42) 

S x (x) ~ {a 2 xf 2 = (0.669a;) ~ a42 for 30 < 1000a; < 160 (43) 



with the relative error of the coefficients not larger than a couple of percent in 
both cases. It has to be emphasized that the fit to Sx(x) is biased in the sense 
that the power law exponent has to have the same value as in Sxit) for the 
theory to be valid. Indeed, the parameters to be fitted are expressed by the 
measured parameters ai,b\ of the asymptotic behavior in the following way 
(compare to formulas in Table 1): 

v = ai - C 1 ^ w = a 2 - C 1 ^ and // = h = b 2 (44) 

where C = l/(27iyi)r(l + y) sm (irfi/2)B (1/2, (1 + /i)/2) 
for the case f(x, t) = y/x 2 + t 2 and 

v = ai- D 1 ^ w = a 2 - E 1 ^ and ^ = b x = b 2 (45) 

where D = 2/nT(l + fi) sin(7r/x/2)S (ji, P + 1) and 
E = 2/tiT(1 + n) sin(7r/i/2)P/(/i(/i + P)) 
for the case f(x,t) = x + t. 

Recall that B is the beta function defined in terms of the gamma function as 
B(n,m) = T{n)T{m)/T{n + m). 

From formulas (43), (44) and (45) we obtain values of parameters fi = 1.8662, 
v = 1649(0. 1072), w = 0.0914(0.1044) for the US$/Yen data for the cases 
/ = y/x 2 + t 2 and / = x + t respectively. For the old Irish data we analyzed 
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the first case / = \/x 2 + t 2 only and obtained the following results: /i = 0.42, 
v = 0.2223, w = 0.1354. The results are shown in Figs. 2,3,4,5. Firstly we no- 
tice that the assumption regarding the choice of function f(x, t) does not play 
a significant role. The relative difference of the cumulative probability densities 
does not exceed 0.4 in the US$/Yen case. The price returns distribution fit is 
better than that of waiting times. The x 2 test statistics Xx = 0.497(1.974) are 
smaller than that for waiting times Xr = 3.3223(2.4694) for the same number 
of degrees of freedom 304 in both cases. This does not however discredit the 
model. The US$/Yen waiting times data seem not to exhibit the typical fea- 
tures of probability distributions of financial data, ie exponential and power 
law decay for small and large waiting times respectively. Indeed, an inspection 
of Fig. 3 reveals mysterious humps at t ~ 100 and t ~ 350. This could imply 
that small waiting times are underestimated and large waiting times are over- 
estimated. This could be due to a limited (two decimal digits only) accuracy 
of the data. Such data truncation will indeed introduce a bias toward larger 
waiting times. More accurate data is required to test this suggestion. 

At this point we note in more details the difficulties arising by computing 
Sr(t) from formulas (12) and (13). 

Since a series expansion in t around t — does not exist (as proved in section 
2.1) we computed the cumulative density SV(i) numerically using a Gauss- 
Hermite quadrature. The integrand in (12) behaves like a Gaussian for large 
values of z and the integration range is infinite therefore the Gauss-Hermite 
quadrature is best suited for all integration procedures. However, since the 
integrand is cut off for z < vt we use a large number of abscissas Zi and 
weights Wi in the quadrature 



so that the contribution from the integrand for z > vt is considerable. In 
other words the number M of abscissas has to be large enough so that the 
largest of abscissas z max = maxi Zi is much larger than vt, which, for our 
data, is of the order of v * 200 = 44.5. Furthermore since the weights Wi 
decrease quite strongly with increasing Xj's we have to carry out calculations 
with high precision. We chose M = 3500 which yields z max = 83.2 and the 
corresponding weight w max ~ 7 * 10~ 3011 and conducted computation with 
a precision of 300 digits. This method is however only suited for calculating 
<St(£) for intermediate times and computing the cumulative density for the 
largest waiting times 100 < t < 500 is marked with a considerable error. 

The Gauss-Hermite computation works quite well in the whole range of t 
values for the case of US$/Yen exchange rates, Figs. 2,3. This is because the 



oo 



M 




(46) 
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asymptotic expansion (13), which was derived from an asymptotic expansion 
of the Levy function converg es for all t > for < fx < 1 and diverges for all t 
if 1 < fx < 2 iMontrollI (|l984h . For the old Irish stock data where fx = 0.42 < 1 
the asymptotic expansion (13) diverges. It is clearly difficult to work out a 
numerical procedure for computing St(£) in the whole range of ft parameter 
< fi < 2. 



3.2 Goodness of the fit 



We know that the variables zt and z x conform to a Xn-i distribution with n—1 
degrees of freedom. Since for large n the variable Xn can be approximated by 
the normal distribution Normal(n, 2n) with mean n and variance 2n, we see 
that for n = 200 the computed values zt = 2.70527 z x = 0.910386 are much 
smaller than the 2.5%-value of the distribution which is 200 + 1.96 * V400 = 
239.2, which yields a satisfactory conformance to the theory. 



Here a comment has to be made on the waiting time data (Fig. 3). Unlike 
the cumulative pdfs fr om previ o us pa pers, eg. the New York Stock Exchange 
prices investigated by Rabertol f 2002^ . that are well fitted by an exponential 
for small values of t and by a power law for large t values the cumulative pdfs 
for the US$/Yen data seem to exhibit a power law behavior in the whole range 
of t values and in addition show small humps or shoulders for t > 100. In this 
respect it is not a surprise that the function (12) does not fit the data very 
well and however neither did the approach developed in ISabatellil (Effll. At 
least using our approach the price returns cumulative pdfs are quite well fitted 
with the same parameter fi as that used in the waiting time data. 



3. 3 Correlation between price returns and waiting times 



We now analyze correlations between different functions of waiting times and 
price returns for the market data and try to justify the assumption made in 
section 2.1 regarding the choice of function f(x,t). 

The main objective of this article was to show that price returns x and waiting 
times t cannot be regarded as independent variables and to find such functions 
of x and t, which, within the accuracy of data available can be regarded as 
independent. 

Indeed a standard contingency test between x and t carried out in Table 2 
shows that the null hypothesis, of the variables being independent can be 
rejected at a 1% confidence level. 
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This begs a question whether there is a mapping of x and t onto new vari- 
ables that can be considered as independent. In section 2.1 we considered the 
mapping (3) and we assumed all functions 0j in (4) to be equal to one. This 
is equivalent to claiming that the variables /, 9±, . . . , 9 P are independent while 
the original x±, . . . ,x p are not. Let us now check this assumption numerically 
making use of the market data. 

Consider the case p — 1 in (3), ie a case of only one stock, and set up a 
contingency table between r = yj (to) 2 + (vt) 2 and = arctan (vt) / (wx) in 
Tables 4 and 5. 

The prices of stocks were given with an accuracy of two decimal digits, which 
allows computation of the error 

dx = dlogp n+1 /p n = dp n+1 /p n+1 + dp n /p n ~ 0.01/ < p n >~ 10~ 4 (47) 



From this information and from the range in which waiting times and log- 
price returns vary 1 < t < 2606 and 10~ 9 < x < 1.910~ 2 we calculate the 
accuracy r ± dr and ± d(p of random variables r = J (wx) 2 + (ft) 2 and 



= arctan (vt) / (wx) , the mutual independence of which we are going to 
investigate. Total differentials of r and yield: 



dr = |sin(0)| vdt + |cos(0)| wdx 



d(f) 



COS(0) 



vdt + 



sin(0) 



wdx 



(48) 



We calculate the differentials dr and d<p for r, values from Table (4) and 
estimate the errors 5r and 50 by the average value of the corresponding differ- 
ential. This yields a result 5r = 0.165 and 50 = 0.002. If we compare this with 
the range of values in each box of the Table (4), we get r m+ i — r m = 0.206 
and n+ i — n = 7r/20.0008 = 0.0012 which is on the verge of the accuracy of 
measurements. 

Since the Table (4) contains many zeros in the upper and lower right corners 
it would be reasonable to refine the range of variability, introducing more rows 
in the center of the table where most data points are gathered and assemble 
a couple of top rows and a couple of bottom rows into single rows. This 
would however yield a refinement much below the measurement error unless 
the number of rows and thus the number of degrees of freedom was decreased 
resulting in a less certain estimate. 

Indeed, a contingency test in Table (5) with the same number of rows and 
columns but with a refined range of the variable = tt/2(1 + 8(n — 5)10~ 4 ) 
yields a much larger value of the test statistic z = 1507.7 than that from 
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Table (4) suggesting the null hypothesis should be rejected. However, since 
the values are marked with an error 5<p = 0.002, which is one order of 
magnitude larger than the increment 4> n+ i — 4> n = n/2 ■ 8 • 10 -4 such a test is 
not reliable. 

The accuracy of measurements is not an issue for a contingency test between 
the log-returns x and the waiting times t themselves. 



4 Conclusions 

We have developed a new generalization of the continuous time random walk 
that recognizes waiting times and associated jumps may not be independent. 
The theory has been applied to stock price data. Two data sets were used 
namely old 19th century Irish daily data and 20th century US$/Japanese Yen 
currency data. Since our theory avails of analytical results it would be inter- 
esting to pursue further investigation of correlations between waiting times 
T and price jumps X. Note that averages of the kind E [X n T m ] are read- 
ily expressed through derivatives of the joint pdf in Fourier-Laplace space 
d%d™<p(k, s). In particular, we see from (37) that E [X»] ~ f3i, E [X^T] ~ fa 
and E [X^ 2 T 2 ] ~ [3 3 . Once more accurate data was available these results can 
be used to validate the theory. 
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t = n * 10 
[min] 


X = -0.005 + 


0.0025 * m 










< 1 


1-2 


2-3 


3-4 


> 4 


— 


0-1 


439(560.569) 


1103725(1. 10344c+06) 


1102567(1.10263o+06) 


364(442.292) 


91(111.318) 


2207186 


1-2 


19( 2.579) 


4918(5078.31) 


5209(5074.56) 


10( 2.035) 


2( 0.512) 


10158 


2-3 


9( 0.398) 


753(784.893) 


802(784.313) 


5( 0.314) 


1( 0.079) 


1570 


3-4 


7( 0.141) 


294(279.462) 


251(279.255) 


5( 0.112) 


2( 0.028) 


559 


4-5 


11( 0.066) 


112(130.982) 


130(130.885) 


9( 0.052) 


0( 0.013) 


262 


5-6 


4( 0.046) 


88( 91.987) 


86( 91.919) 


4( 0.036) 


2( 0.009) 


184 


6-7 


5( 0.032) 


63( 62.991) 


54( 62.944) 


4( 0.025) 


0( 0.006) 


126 


7-8 


0( 0.017) 


28( 33.995) 


36( 33.97) 


4( 0.013) 


0( 0.003) 


68 


8-9 


3( 0.011) 


15( 21.997) 


22( 21.98) 


4( 0.008) 


0( 0.002) 


44 


9-10 


1( 0.013) 


20( 26.496) 


29( 26.476) 


2( 0.01) 


1( 0.002) 


53 


> 10 


66( 0.123) 


181(242.467) 


191(242.288) 


34( 0.097) 


13( 0.024) 


485 


v.. 


564 


1110197 


1109377 


445 


112 


2220695 



Table 2 



Contingency table between x = \og{p n+ \/p n ) and t for the US$/Yen data. The 
numbers in brackets are the expected frequencies under the assumption that x and 
t are independent variables. The test statistics z = Yli(Oi~Ei) 2 / 'Ei ~ xlo = 65619.1 
is clearly above the 2.5%-value of the distribution which is xlo 2 5% = 40 + 1.906 * 
^80 = 57.0478 ' 



t = n * 10 


r = -0.125 


f 0.0625 * m 










[days] 


< 1 


1-2 


2-3 


3-4 


> 4 


^ j t 


0-1 


5( 17.084) 


469(482.644) 


738(697.627) 


6( 14.949) 


0( 5.694) 


1218 


1-2 


2( 3.394) 


105( 95.894) 


130(138.609) 


4( 2.97) 


1( 1.131) 


242 


2-3 


5( 1.683) 


49( 47.551) 


58( 68.731) 


6( 1.472) 


2( 0.561) 


120 


3-4 


3( 0.673) 


17( 19.02) 


23( 27.492) 


4( 0.589) 


1( 0.224) 


48 


4-5 


1( 0.378) 


12( 10.699) 


13( 15.464) 


1( 0.331) 


0( 0.126) 


27 


5-6 


0( 0.182) 


8( 5.151) 


4( 7.445) 


0( 0.159) 


1( 0.06) 


13 


6-7 


2( 0.154) 


5( 4.358) 


3( 6.3) 


0( 0.135) 


1( 0.051) 


11 


7-8 


1( 0.098) 


2( 2.773) 


4( 4.009) 


0( 0.085) 


0( 0.032) 


7 


8-9 


1( 0.098) 


4( 2.773) 


2( 4.009) 


0( 0.085) 


0( 0.032) 


7 


9 - 10 


0( 0.056) 


2( 1.585) 


1( 2.291) 


0( 0.049) 


1( 0.018) 


4 


> 10 


4( 0.196) 


5( 5.547) 


4( 8.018) 


0( 0.171) 


1( 0.065) 


14 


v.. 


24 


678 


980 


21 


8 


1711 



t 



Table 3 

The same as in Table 2 for the old Irish data. The test statistics z = 304.798 is clearly 
above the 2.5%-value of the distribution which is X40 2 5% = 40 + 1.906*\/80 = 57.05 
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r = 0.206 * m 



f (1 + 8(n - 5)10" 4 ) 


< 1 


1-2 


2-3 


3-4 


> 4 


— 1. v 


0-1 


6( 4.937) 


0( 0.612) 


0( 0.201) 


0( 0.14) 


0( 0.108) 


6 


1-2 


17( 13.988) 


0( 1.735) 


0( 0.569) 


0( 0.397) 


0( 0.308) 


17 


2-3 


33( 27.154) 


0( 3.369) 


0( 1.105) 


0( 0.77) 


0( 0.599) 


33 


3-4 


337(307.752) 


29( 38.188) 


7( 12.53) 


0( 8.736) 


1( 6.792) 


374 


4-5 


910745( 912974) 


114348( 113289) 


37462(37172.2) 


26331(25917.3) 


20618(20151.2) 


1109504 


5-6 


915832( 913687) 


112353( 113378) 


36929(37201.3) 


25542(25937.5) 


19714(20166.9) 


1110370 


6-7 


311(274.837) 


19( 34.104) 


3( 11.19) 


1( 7.802) 


0( 6.066) 


334 


7-8 


37( 32.091) 


2( 3.982) 


0( 1.306) 


0( 0.911) 


0( 0.708) 


39 


8-9 


16( 13.165) 


0( 1.633) 


0( 0.536) 


0( 0.373) 


0( 0.29) 


16 


9-10 


0( 0) 


0( 0) 


0( 0) 


0( 0) 


0( 0) 





> 10 


2( 1.645) 


0( 0.204) 


0( 0.067) 


0( 0.046) 


0( 0.036) 


2 


v.. 


1827336 


226751 


74401 


51874 


40333 


2220695 



Table 4 

Contingency table between r = \J (wx) 2 + (vt) 2 and <fi = arctan (vt) / (wx) for the 
US $/Yen data. The numbers in brackets are the expected frequencies under the 
assumption that r and <j) are independent variables. The null hypothesis cannot 
stand but the test statistics z = ~ Ei) 2 /Ei ~ xlo = 138.646 > 40 + 1.906 * 
\/80 = 57.048 has a much smaller value than in Table 1 



<t> = £(1 + (n - 5)410~ 4 ) r = 0.206 *m 





< i 


1-2 


2-3 


3-4 


> 4 




- 1 


56( 46.08) 


0( 5.718) 


0( 1.876) 


0( 1.308) 


0( 1.017) 


56 


1-2 


68( 66.652) 


13( 8.27) 


0( 2.713) 


0( 1.892) 


0( 1.471) 


81 


2-3 


269(241.099) 


16( 29.917) 


7( 9.816) 


0( 6.844) 


1( 5.321) 


293 


3-4 


4071(3419.83) 


71(424.361) 


6( 139.24) 


5( 97.081) 


3( 75.482) 


4156 


4-5 


906674( 909554) 


114277( 112865) 


37456( 37033) 


26326(25820.2) 


20615(20075.7) 


1105348 


5-6 


911819( 910295) 


112267( 112957) 


36920(37063.2) 


25534(25841.3) 


19709(20092.1) 


1106249 


6-7 


4013(3391.03) 


86(420.787) 


9(138.067) 


8( 96.263) 


5( 74.846) 


4121 


7-8 


238(207.362) 


10( 25.731) 


3( 8.442) 


1( 5.886) 


0( 4.576) 


252 


8-9 


73( 67.475) 


9( 8.372) 


0( 2.747) 


0( 1.915) 


0( 1.489) 


82 


9-10 


28( 23.04) 


0( 2.859) 


0( 0.938) 


0( 0.654) 


0( 0.508) 


28 


> 10 


27( 23.863) 


2( 2.961) 


0( 0.971) 


0( 0.677) 


0( 0.526) 


29 


T.. 


1827336 


226751 


74401 


51874 


40333 


2220695 



t 



Table 5 

The same as in Table 4 only for a refined range in <f>. Now the test statistic is larger 
than before z = 1507.75 but the amount by which <j> increases d<\> ~ 4 * 10 -4 is on 
the verge of the measurement accuracy. 
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Distribution of changes of logarithms of prices. 

Japnese Yen/US Dollar exchange rates from 1989 to 1999 




x = 1000*ldpl 



Fig. 2. The distribution of changes of logarithms of US$/ Japanese Yen exchange 
rates and the fitted theoretical formulas Sy(x) and S\(x) corresponding to the 
case in first and second row in Table 1 respectively. The test statistic equals 
Xx = 0.497(1.974) for 304 degrees of freedom. The inset shows the relative dif- 
ference between S x and S\. 

Waiting time distribution 

Japanese Yen/US Dollar exchange rates from 1989 to 199 




waiting time t [minutes] 
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Fig. 3. Like in Fig. 2 but for waiting times. The test statistic equals 
Xt = 3.3223(2.4694) for 304 degrees of freedom. 



Distribution of changes of logarithms of prices. 

Ensemble of old Irish stocks( 1 850-4) 




x = 1000*ldpl 



Fig. 4. The distribution of changes of logarithms of old Irish stocks and the theoret- 
ical curve S x (x). The test statistic equals x\ = 4.666 for 200 degrees of freedom. 
A large deviation at small x is due to the difficulty of computing S x (x) for small 
x. 




Fig. 5. Like in Fig. 4 but for waiting times. The test statistic equals Xt = 1-7595 
for 200 degrees of freedom. 



